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ABSTRACT 

The purposes of this research project are (1) to test 
a theory which purports to identify the x edictive antecedents (i.e., 
precursors) of developmental dyslexia (sptcific reading disability) 
several years before the disorder is clinically evident, and (2) to 
evaluate the mechanism which is postulated to underlie and influence 
later developmental changes in this disorder. The study focused on 
the predictive accuracy of a developmental-neuropsychological test 
battery which was given to a total population of white boys during 
kindergarten (1970) in order to forecast their reading achievement in 
subsequent years. The predictive accuracy of the test battery was 
determined by assessing reading in this population at the end of 
grades one, two, and three- The results revealed that performance on 
these tests during kindergarten was extremely predictive of the 
child's reading group membership in later grades, particularly with 
those children destined to become severely disabled or superior 
readers. These findings lend substantial support to the validity and 
utility of an early detection or warning system taat could be 
administered economically before the child begins formal reading and 
at a time when he is less subject to the effects of academic failure. 
(RB) 
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The present chapter is addressed to a longitudinal review (Years 2 and 3) 
and follow-up evaluation (Year 4,: of the predictive validity of a develop- 
mental neuropsychological test battery that was administered to an original 
population (N = 497) of white boys at the beginning of kindergarten in 1970 
in Alachua County, Florida. The children represented virtually all of the 
white male population (96%) enrolled in the 20 schools (14 urban, 6 rural) in 
1970. In addition, the chapter is addressed to a three year cross-validation 
follow-up on an additional sample of 181 white boys who were tested at the 
beginning of kindergarten in 1971 with the same battery to forecast reading 
achievement at the end of Grade 2 (1974). 

The purposes of this research project are twofold: (1) to test a theory 
(Satz and Sparrow, 1970; Satz, Rardin and Ross, 1971; Satz and Van Nostrand, 
1973) which purports to identify the predictive antecedents (i.e., precursors) 
of developmental dyslexia (specific reading disability) several years before 
the disorder is clinically evident and (2) to evaluate the mechanism which is 
postulated to underlie and influence later developmental changes in this dis- 
order. 

The present chapter is addressed to the first objective (i.e., early de- 
tection) because not enough time has elapsed in the longitudinal project (four 
years) in order to investigate the developmental course of this disorder after 
the reading disability has been diagnosed and confirmed. However, preliminary 
data on developmental changes within the first three years (K-G2) are reported 
in a recent paper (Satz, Friel and Rudegeair, 1974). 

The present research springs from the need for an early and valid detec- 
tion or 'warning system 1 that could be administered before the child begins 
formal reading --at a time when his central nervous system may be more plastic 
and responsive to change --and at a time when he is less subject to the 
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shattering effects of repeated academic failure. The need for early detection 
research in the field of childhood reading disorders has long been recognized 
but has been marked by a paucity of well-con; rolled longterm longitudinal 
.studies (Critchley, 1968; Money, 1962; Eisenbe>-, 1966; Gallagher and Bradley, 
1972: Kline, 1962; Ames, 1968; Satz, 1973). The advantages of an early identi- 
fication and remediation program are highlighted by the recent survey by 
Keeney and Keeney (1968), It was shown that « , .when the diagnosis of dys- 
lexia was made in the first two grades of school nearly 82 percent of the 
students could be brought up to their normal classroom work, while only 46 
percent of the dyslexic problems identified in the third grade were remediated 
and only 10 to 15 percent of those observed in grades five to seven could be 
helped when the diagnosis of learning problems was made at those grade levels" 
(Strag, 1972, p. 52). 

Experimental studies with humans and infrahumans also suggest that the 
child may be more responsive to environmental stimulation (e.g., remediation) 
during that period in which the brain is maturing and when behavior is less 
differentiated (Caldwell, 1968). Bloom (1964) has shown that variations in 
the environment have their greatest quantitative effect on a characteristic 
(e.g., speech) at its most rapid period of change (i.e., ages 2-10) and the 
least effect at the least rapid period of change (i.e., ages 11-15). Infra- 
human studies also suggest that organization can be strongly modified only 
when active processes ojc organization are underway and that when facilitated, 
they progressively inhibit attempts at reorganization (Scott, 1968). 

Despite the obvious advantages for an early intervention thrust, they 
must be based on a valid end efficient detection procedure, especially when 
the detection measures are applied during preschool and before formal read- 
ing instruction is commenced. Within such a prevention context, certain types 
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of prediction errors are crucial to the utility of an 1 early warning 1 system-- 
e.g., the false positive and negative signs. To initiate an intervention 
program for test-classified high risk children in kindergarten may be 
fruitless if the majority of true high risk children are missed by the tests 
(i.e., false negatives). Equally serious and perhaps more risky is the case 
where an intervention program ic based on erroneously test-classified posi- 
tives (high risk* who in three years would have become average to superior 
readers without remediation (i.e., false positives). These prediction errors 
can occur with tests despite an apparent demonstration of validity via more 
simplified descriptive univariate tests of significance (e.g., t-test, Chi- 
square). In other words, the experimental validation of an early detection 
battery must incorporate a multivariate design in which multiple measure- 
ments are made on the same subjects over time with sufficient temporal 
separation between the initial test probe (e.g., kindergarten) and the 
criterion reading assessment in later years (e.g., Grades 1-3). This type 
of design should utilize a longitudinal framework based preferably on a total 
population rather than smaller sample of children to offset the potential 
attrition effects over time and to provide more reliable base rate estimates, 
of reading disability in the (fesignated population. Further, the selection of 
a more homogeneous population (e.g., white boys) provides an opportunity 
to obtain a higher at-risk group for later reading disability (i*e., 
dyslexia) without confounding sex, race or cultural variables (Bentzen, 1963; 
Satz, 1973) . A final methodological requirement, essential to the evaluation 
of an early detection battery, concerns the use of a separate group of children 
upon which to cross-validate the predictive validity of the tests administered 
to the standardization population- -and for the same interval of time between 
the initial probe and the reading criterion assessment. 
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Failure to incorporate the preceding methodological factors into an 
early detection study would surely limit if not invalidate the results. A 
review of the current early detection literature reveals an alarming disregard 
of these problems and a scathing indictment of the area (Gallagher and Bradley, 
1972). 

An equally serious problem in the early detection literature concerns the 
lack of a theoretical framework in which to conceptualize the nature of the 
disorder—namely dyslexia—and its antecedent precursors. Without a testable 
theory one is bound to be restricted in the selection of a test battery which 
purports to identify the potentially high risk child. Although a theory re- 
presents a framework in which to organize diverse sets of data--often seeming- 
ly unrelated—it 1 s ultimate validation must rest on empirical verification 
over time. In the present context, a theory which postulates the precursors 
of developmental dyslexia, before the child begins formal reading, can only 
be evaluated empirically within a longitudinal framework. Further, if support 
for the theory can be replicated and cross validated in later years, then in- 
formation is available for application (i.e., early intervention) and for fur- 
ther understanding of this complex disorder of childhood (i.e., etiology and 
mechanism) . 

The preceding theoretical rationale reflects some of the basic assumptions 
that underlie all behavioral research (Lindgren and Byrne, 1971, pp. 18-19): 

1. All behavior is caused— that is, it is deter- 
mined by and is the necessary consequence of 
antecedent events. 

2. The causes of behavior are multiple. 

3. The causal factors leading to the variance in 
the behavior being studied must be identified, 
if valid prinicples or generalizations arc to 
be developed. 

4. The principles or generalizations that govern 
causal factors or account for variations in 
behavior are simpler than the original data 
on which they are based. 



BEST COPY AVAILABLE 



page 5 

5. The test of whether prinicples or general- 
izations are valid is whether they can be 
used to predict behavior. 

Theory . The theory which provides the methodological and conceptual 
framework for the current longitudinal research has been discussed in pre- 
vious papers (Satz and Sparrow, 1970; Satz, Rardin and Ross, 1971; Satz and 
Van Nostrand, 1973). Briefly, it postulates that developmental dyslexia 
reflects a lag in the maturation of the brain which delays differentially 
those skills which are in primary ascendancy at different chronological ages. 
Consequently, those skill? which during childhood develop ontogenetically 
earlier (e.g., visual-perceptual and cross-modal sensory integration) are 
more likely to be delayed in younger children who are maturationally imma- 
ture. Conversely, those skills which during childhood have a later or slower 
rate of development (e.g., language and formal operations) are more likely to 
be delayed in older children who are maturationally immature. 

The theory is compatible with those developmental positions which pos- 
tulate that the child goes through consecutive stages of thought during devel- 
opment, each of which incorporates the processes of the preceding stage into 
a more complex and hierarchically integrated form of adaptation (Hunt, 1961; 
Plaget, 1926; Bruner, 1968). Thus, it is predicted that those children, 
durine preschool, who are delayed developmentally in skills which are in pri- 
mary ascendancy at this stage, will eventually fail in acquiring reading pro- 
ficiency. It is predicted, however, that these children will eventually 
"catch up" on these earlier developing skills but will then lag on those more 
cognitive- linguistic skills which have a slower and later ontogenetic develop- 
ment (Thurstone, 1955; Bloom, 1964). In other words, the theory predicts that 
the nature of the disorder will vary as a function of the chrongological age 
of the child. 
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More specifically, the lag in brain maturation is postulated to delay 
the acquisition of those developmental skills which have been shown to be 
crucial to the early phases of reading- -namely , learning to differentiate 
graphic symbols (Gibson, 1968) or the perceptual discrimination of letters 
(Luria, 1966). Both authors recognize an orderly and developmental sequence 
in which the early phases of reading are characterized by processes of percept- 
ual discrimination and analysis. In this early phase the child must discrim- 
inate the distinctive features of letters (e.g., break vs. close, line vs. 
curve, rotation and reversal) before he can proceed to later phases which 
require more complex phonetic and linguistic analysis. Smith (1971) also 
recognizes the importance of learning the distinctive features of written 
language in the beginning phases of reading, but cautions that fixation at 
this level will retard the syntactic process of fluent reading. 

The theory, in su*. »ary, conceptualizes developmental dyslexia as more 
than a reading disorder per se . That is, the disorder is explained as a 
delay in those crucial early sensori-perceptual and later conceptual-linguis- 
tic skills which are intrinsic to the acquisition of reading and which are 
triggered by a lag in the maturation of the cerebral cortex. (The lag mech- 
anism, being unobservable at the present time, is treated as a hypothetical 
construct.) In other words, dyslexia (specific reading disability)is seen as 
a disorder in central processing, the nature of which varies with the chrono- 
logical age of the child. This delay in central processing is not meant to 
imply damage, loss of function or impairment. Such terms are more compatiable 
with a disease model which often implies a static developmental-acquisition 
course; With respect to early detection of high risk children, the theory pre- 
dicts that delays in those developmental skills which are in primary ascendancy 
during preschool (i.e., kindergarten) are more likely to forecast later prob- 
lems in reading by Grades 2 and 3. 

8 
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Current Objectiv es . , The following section is divided into two parts, 

the first of which reviews the second and third year predictive follow-up of 

those children (N = 497) who were tested at the beginning of kindergarten in 

Alachua County (1970) and for whom independent criterion reading measures were 

obtained at the end of Grades 1 (1972) and Grades 2 (1973). The results of 

these two follow-up studies are reported elsewhere in detail (Satz and Friel, 

2 

1974; Satz, Friel and Goebel, in press). The second part of this section is 
addressed briefly to two recent unpublished studies: the first subpart con- 
cerns the fourth year predictive follow-up of this original population of 
boys (N = 497) for whom independent reading measures were obtained at the end 
of Grade 3 (1974); the second subpart concerns a three year cross-validntion 
of the weights derived from the standardization population (1970, N - 497) to 
an additional sample of 181 white boys who were tested at the beginning of 
kindergarten (1971) with the same battery to forecast reading achievement at 
the end of Grade 2 (1974). 

REVIEW STUDIES ( Part I ) , 

3 

Secon d Year Follow-up ( 1970 - 1972 ) . 

During the spring term of 1972 (Grade 1) independent reading measures 
were obtained for 957* of the boys who were tested at the beginning of kinder- 
garten in 1970 (N = 473). This figure represents an attrition rate of only 
57o during the two year interval. The re^' j measure, which was completed 
by the individual classroom teachers (within and outside Alachua County), was 
adapted from the standard grade report for Grades 1-3 of Alachua County, 
Florida. On this basis, children were classified into four dichotomous read* 
ing groups which can be seen in the column totals of Table Is Severe (N « 18), 
Mild (N = 55), Average (N = 339) and Superior (N = 61). These frequencies in- 
dicate a 157o incidence of reading disability at the end of Grade 1 (73/473) 

9 
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with an incidence of only 4% in che Severe group (18/473). These incidence 
figures, which were based on preliminary reading assessment in our population 
(Grade 1, 1971) were later shown to be premature when assessed at the ends of 
Grades 2 and 3 (see following sections). 

insert Table 1 about here 

Despite the tentative nature of this reading measure, the developmental 
tests given in 1970 (n = 19) correctly predicted 84.4% of the children into 
four discrete reading groups (Satz and Friel, 1974). These results, based on 
a multiple discriminant function analysis (4 group), can be visualized in 
Table 1 by comparing the test predictions (+, -) in the rows (1970) again,;t 
the criterion outcomes (n = 4) in the columns (1972). For valid positives 
the tests correctly predicted all of the 18 Severe cases (Vp = 1007o) and 39 
of the 55 Mild cases (Vp = 71%), For valid negatives, the tests correctly 
predicted 284 of the 339 Average readers (V n = 8470 and 58 of the 61 Superior 
readers (V n = 95Z) . In other words, the predictive accuracy was largely con- 
fined to the extreme reading groups with overlap error largely confined to the 
Mildly Disabled and Average reading groups. In terms of overall hit-rate, the 

tests correctly predicted 399 (18 + 39 + 284 + 58) of the 473 Ss or 84.4% of 

4 

the standardization population, 

insert Table 2 about here 

Predictive ranking of tests , A stepwise regression analysis was then 
computed to determine the ranking of the predictor variables in terms of 
their criterion discrimination. Table 2 presents the ranking of the most 
accurate variables along with their cumulative hit frequencies and factorial 
loadings* Inspection of this table reveals that the Finger Localization Test 
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ranked highest (76%) followed cumulatively by the Recognition - Discriminatim 
Test (77%), Day. of Testing (79%) and the Alphabet Recitation Test (82%). The 
remaining variables contributed an additional increment of less than three 
percent to the total hit-rate of 84.4%. This table (Table 2) also shows that 
three of the most discriminating variables loaded on Factor I, which has pre- 
viously been defined . as a general measure of sensori-perceptual-motor- 
mnemonic ability (Satz and Friel, 1973). 5 In the original factor analysis, 
four factors emerged (principal axis solution and orthogonal rotation to 
varimax criterion): Factor I - tests of sensori-perceptual-motor-tnnemonic 
ability (30.7% common variance); Factor II - kindergarten teacher evaluations 
(16% common variance): Factor III - tests of conceptual- verbal ability (13.4% 
common variance) and Factor IV - tests of motor dominance and laterality (7.77. 
common variance). Factor I is felt to tap those skills which are in primary 
ascendancy during preschool years (i.e., kindergarten). As such, the results 
were felt to lend preliminary support for those developmental precursors 
postulated to underlie and forecast subsequent reading achievement (Satz and 
Van Nostrand, 1973). When the means and standard deviations were examined 
for each of the kindergarten measures (1970) it was observed that those child- 
ren destined to reading problems at the end of Grade 1 (1972) were lagging 
behind the to-be good readers on most of the developmental tests--but par- 
ticularly on the Factor I tests. In fact, on the Beery Developmental Test of 
Visual-Motor Integration (Factor I), the High Risk reading group ( Severe and 
Mild ) revealed almost a 12 month lag between their chronological age (65.8 
months) and performance age (54.4 months). By contrast, the Low Risk reading 
group (Average and Sup erior ) virtually matched their chronological and per- 
formance age on this test (66.4 months vs. 66.5 mouths, respectively). 
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Third Year Follow-up ( 1970 - 1973 ). 

Despite the encouraging preliminary findings in the two year follow-up 
(Satz and I'riei, 1974) the interval of an additional year provided an oppor- 
tunity to obtain more objective and stable measures of reading achievement in 
this population at the end of Grade 2 in 1973 (Satz, Friel and Goebel, in press), It 
was also felt that the availability of wore objective reading measures by 
Grade 2 (spring) would provide more valid estimates on the incidence of read- 
ing problems in this population (white boys) and a comparison of the differ- 
ential incidence of reading disability, if any, between Grades 1 and 2< Fur- 
thermore, with more oojective reading measures, Independently assessed, it 
was felt that a more rigorous evaluation of the predictive validity of the 
kindergarten tests could be made (K-G2) , including their, discriminative rank- 
ing. 

With this objective in mind, two different methods of reading assessment 
were obtained at the end of Grade 2 (1973). ^The first method was again based 
on the Glassroom Reading Level which was filled out by the individual teachers 
both within and outside of Alachua County, Florida. Reading forms were re- 
turned for 458 of the original 497 Ss which represents 927 0 of the population 
three years later (attrition = 8%). Subjects whose reading was assessed at 
Levels 0-4 (Nj Readiness through Primer) were designated as the Severely Dis - 
abled readers. Those assigned Level 5 (Firsc Reader) comprised the Mildly 
Disabled reading group. Subjects reading at Levels 6 and 7,. (Second Readers) 
were designated as Average readers, and those reading above that level com- 
prised the the Superior reading group. 

The second criterion reading measure was based on Classroom Reading 
Level and the IOTA. This combined reading measure afforded the advantage of 
incorporating both the teacher's assessment of reading, based on nearly a 
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year's interaction with the child, and an independent, individually adminis- 
tered, objective reading test. Both of these measures were obtained on 419 
of the original population (85%).^ After converting the raw scores on each 
measure to T-scores (X - 50, s « 10), the distribution was dichotomized again 
into four reading groups ( Severe , Mild , Average , and Superior ) . 

Criterion I Analysis . The first four-group discriminant function analysis 
(Program DSCRIM) was computed on Classroom Reading Level in order to compare 
the predictive validity of the tests (1970) against a similar type of reading 
criterion at the end of Grade 2 (1973). The results of this analysis can be 
seen in Table 3 by comparing the test prediction (+,-) in the rows (1970) 
against the criterion outcomes (n = 4) in the columns (1973). Before dis- 
cussing the predictive hits, it can be seen that the incidence of reading 
disability (columns) increased substantially between Grades land 2 (1972- 
1973): revere (N = 54), Mild (N = 66), Average (N = 270), Superior (N » 68). 
This criterion distribution reveals a 26% overall incidence of reading dis- 
ability [(54 + 66V458)J and an incidence of 12% in the Severe group (54/458). 
Compared to Grade 2 (1972), this represents a three-fold increase in the 
frequency of Severe cases (1973). 

To further assess the validity of this criterion measure (i.e., Class- 
room Reading Level), scores on the Gates-McGinitie, which was administered 
to the total population of children in Grade 2 (Spring) in Alachua County, 
were averaged for each of the reading groups. Vocabulary recognition scores 
on this test were compatible with the results of Classroom Reading Level 
^Severe (-12.3 mos.), Mild (-6*5 mos.), Average (+7*9 mos*) and Superior 
(+18.8 mos.)]. In other words, the Severe group was well over a full grade 
level behind in reading (where academic year » 10 mos.). In contrast, the 
Superior group was almost two years ahead in terms of grade level and month 
of testing. Moreover, the Mild group was only six months behind grade level 
whereas t-.p Average reading group was approximately eight months ahead. 
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insert Table 3 about here 

Table 3 also reveals the predictive hits. For valid positives, the tests 
correctly predicted 48 of the 54 Severe cases, (Vp 551 89%) and 47 of the 66 
Mild cases (Vp * 7V/.), For valid negatives, the tests correctly predicted 
196 of the 270 Average readers (V n * 73%) and 64 of the 68 Superior readers 
(V = 94%), Once again, the predictive accuracy of the tests was largely 
confined to the extreme reading groups with overlap error largely confined 
to the Mildly Disabled and Average reading groups. In terms of overall 
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hit-rate, the tests correctly predicted 355 (48 + 47 + 196 + 64) of the 
458 SwS or 78% of the population, 

insert Table 4 about here 

Predictive ranking of t ests , A stepwise regression analysis was then 
computed to determine the ranking of the predictor variables in terms of 
their criterion discrimination. Table 4 presents the ranking of the most 
accurate variables along with their cumulative hit frequencies and factorial 
loadings. Inspection of this table reveals that the Finger Localization Test 
ranked highest (71%) followed cumulatively by the Alphabet Recitation Test 
(76%), Recognition - Discrimination Test (77%) and Day-of- Testlng (77%). The 
remaining tests contributed an additional increment of less than one percent 
to the total hit- rate of 78%. This table (Table 4) again shows the same 
four variables that ranked highest in the two year follow-up (Grade 1, 1972). 
three of which, in both studies, loaded on Factor I (Table 2). This finding 
again strengthens the validity of sensori-perceptual-motor-mnemonic abilities 
in forecasting subsequent reading achievement levels. 

insert Table 5 about here 

Criteri on II Analysis . This analysis was based on the predictive 
accuracy of the tests (n » 19) given at the beginning of kindergarten (1970) 
to the combined criterion of Classroom Reading Level plus IOTA Word Recogni- 
tion at the end of Grade 2 (1973). These result; can be seen in Table 5 by 
comparing the test predictions (+, -) in the rows (1970) against the criterion 
outcomes (n = 4) in the columns (1973). First, the criterion outcomes in 
the columns revealed an even higher incidence of reading disability: Severe 
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(N * 67), Mild (N * 77), Average (N » 214), and Superior (N « 61) . Although 
based on a smaller number of Ss, this criterion assessment revealed a 347o 
overall incidence of reading disability [(67 + 77V419) and an incidence 
of 16% in the Severe group (67/419) --which is four times higher than was 
observed at the eod of firade 1 (1972). 7 

The validity of this combined reading criterion was further assessed 
by computing the mean scores of the Gates-McGinitie for each of the four 
reading groups. Vocabulary recognition scores on the Gates were in 
essential agreement with the reading groups based on Classroom Reading 
Level and the IOTA f Severe (-12.1 mos.), Mild (-10.7 mos.), Average 
(+3.3 mos.) and Superior (+16.3 mos.)] . In other words, the Severe 
group was over a full grade level behind in reading and the Superior 
group was over a year and a half ahead in terms of grade level and 
month of testing. Mo reefer, the Average reading group was essentially at 
grade level. 

The predictive hits can also be seen in Table 5. For valid positives, 
the tests correctly predicted 61 of the 67 Severe cases (V p * 91%) and 51 
of the 77 Mild cases (V p = 66%). For valid negatives, the tests correctly 
predicted 146 of the 214 Average readers (V n » 68%) and 59 of the 61 
Superior readers(V n » 97%). Once again, the total hits (61 + 51 + 146 + 
59 = 317/419 or 76%) were largely confined to the extreme reading groups* 

Toe 

Ifee purposes of brevity, the stepwise regression analysis also revealed 
the same discriminative ranking of tests that was found in the second and 
third year follow-up studies using Classroom Reading Level as the criterion* 
The Finder Localization Test ranked highest followed cumulatively by the 
Alphab et R ecitation Test, Recognition - Discrimination Test and Day - of - Testing * 
Three of these four variables each loaded significantly (r 5*37) on Factor I. 

° 1 (ft 
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Two additional findings emerged from this third year follow-up that 
merit attention for educators. First, it was found that an abbreviated sample 
of the L9 tests yielded virtually the same predictive accuracy as the full 
battery* Selection of this abbreviated battery (seven test and one nontest 
variables) was based on empirical evaluation of the highest ranking tests 

Q 

in each of the stepwise regression analyses across the years. After 
separate discriminant function (Program DSCRIM) and stepwise regression 
analyses were computed against the combined reading criterion (Grade 2, 
1973) , it was found that eight of the tests given in 1970 detected almost 
all of those children destined to extremes in the reading distribution at 
the end of Grade 
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2 (i.e., Severe and Superior groups). Also, the same three Factor I testa 
again ranked highest ( Finger Localization . Alphabet Recitation , and Keccgni - 
tion- Discritnt nation) . This finding, which minimizes both time and cost 
factors associated with test administration, without sacrificing accuracy, 
greatly increases its applicability for educators concerned with large scale 
early screening assessment. 

The second finding, of possible interest to educators, concerned the 
utility of this test battery (abbreviated or standard) to provide a decisional 
basis upon which to initiate or withhold treatment for an individual child 
based on his test scores at the beginning of kindergarten. This decisional 
process, which was prompted by the consistently high predictive accuracy 
in the extreme reading groups across years, is determined by computing 
the conditional probability of the differential test signs (+, -). These 
conditional probability values are based on the inverse probablities of the 
test signs (valid and false positives and valid and false negatives) and 
the base rates of reading competency in this population (Meehl and Rosen, 
1955; Satz, Fennell and Reilly, 1970). To compute the conditional probabili- 
ties for each of the test signs, the (+) and (-) rows in Tablo 5 were sub- 
divided (on the basis of the original four group DSCRIM analysis) to generate 
four levels of test decisions g++) , (+),(-), (--J . Table 6 reveals this 
4X4 contingency table in which test signs (and decisions) are represented 
by rows and the outcomes are represented by columns. This table thus allows 
the educator to determine the probability that a given child is destined to 
reading disability or competency, in three years, given that his test scores 
fall in the (4+) , (+) , (- ) , (--) range. As such, it also provides him 
with the likelihood that intervention should be instituted or withheld. 
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insert Table 6 about here 

The results of this analysis which can be seen in Table 6, indicate that 
with a severe high risk composite sign (++) , the decisLon to initiate 
treatment would have been correct in 82% of the cases; further, this treat- 
ment decision would have included 50 of the 67 potential Severe cases (75%) 
and none of the potential Superior readers in the intervention program. By 
contrast, the decision to intitiate treatment, given a less severe high risk 
composite sign (+) , would have been correct only 44% of the time. In other 
words, extension of treatment services to this group would have yielded only 
11 more of the potential Severe cases and 33 of the potential Mild cases, but 
it would also have involved treatment for 55 children who would not have 
needed it (false positives). Thus, initiation of treatment for this com- 
posite test sign (+) would have resulted in treating more children who didn't 
need it than did* 

This table also shows that the decision to withhold treatment (NT) would 
have been correct 77% of the time given a mild low risk sign (-) and 96% of 
the time given a very low risk sign (--)• The only risk, given a mild low 
rick sign (-) is that treatment would have been withheld for a small number of 
potentially Severe (N = 5) and Mild (N - 23) reading cases. This decision ■ 
risk, however, would have been virtually eliminated by adopting a more con- 
servative no treatment policy based on very low risk signs (--) . 

The latter findings, in summary, illustrate the potential usefulness of 
a detection procedure which, while brief and economical to administer, can 
also generate the conditional outcome probabilities (i«e«, reading level) for 
an individual child during kindergart*^ u. Within this framework, educators 
could base their treatment strategies on a number of factors including the 
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incidence of severe high risk children in the school, available resources, 
and the risks associated with intervention (false positive errors) vs. non- 
intervention (false negative errors). If treatment resources were limited 
and the number of high risk children was large, then surely a conservative 
decisional strategy should be adopted for treatment Q..e., (++) only] . With 
respect to the present study, this treatment strategy would have been correct 
for 827, of the cases; moreover, it would have detected or included 75% of the 
potential Severe cases in the treatment program and none of the potential 
Superior readers! 

On the other hand, if multiple treatment resources were available, ed- 
ucators might choose to adopt a more liberal intervention policy, given the 
present utility table i.e., £(++) and (+)3 • While this strategy would pick 
up most of the potential Severe cases, it would also entail considerable 
efforts and time for many children who would not have needed such help (i.e., 
false positives). But then again, these latter children, through intervention, 
may have eventually become superior readers. 

It should be apparent that the final decision to initiate treatment or 
not should be made by the educator who has to consider a multiplicity of 
factors. The utility table presented in this section merely simplifies this 
decisional task by generating the likelihood probabilities and risks for 
either decision (T or NT) in each individual case. 

These outcome probabilities, while generated by the test signs and base 
rate incidence of reading disability in the population, are of little use to 
educators if the predictive validity of the tests cannot be replicated across 
years within Ss or cross-validated across years on additional J3s. The follow- 
ing section is addressed to this problem. 
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NEW STUD IKS (Part II) 

Fourth Year Follow- up ( 1970 - 1974) . 

In view of the high predictive accuracy of the developmental tests in 
the second and third year follow-up studies (Grade 1, 1972 and Grade 2, 1973, 
respectively), the question was raised as to whether the accuracy would 
attenuate as the test-criterion interval increased to four years. After all, 
an interval of this length would seemingly increase the number of uncontrolled 
environmental, growth and treatment factors which could produce changes in 
criterion group membership and thus lower the predictive accuracy of the tests 
given in 1970. In fact, by the end of Grade 3 (Year 4, 1974), it was learned 
that the vast majority of the Severe cases were receiving remedial help in the 
schools. With this knowledge it also became necessary to determine whether 
the incidence of reading problems decreased between the third and fourth 
follow-up years and if not, to determine whether the same £>s again fell in 
the same criterion groups. 

insert Table 7 about here 

Classroom Reading Level was again used to assign Ss into the four dis- 
crete reading criterion groups at the end of Grade 3 <1974). These reading 
forms, which were independently filled out by the individual classroom 
teachers, were completed for 459 of the original 497 Ss (1970). This figure 

represents almost 93% of the original population now residing in Florida, the 

9 

United States and abroad. The obtained reading distribution can bj seen in 
the column totals of Table 7: Severe (N = 55), Mild (N = 93), Averag e (N = 23' r ) 
and Superior (N « 77). This distribution reveals a 32% overall incidence of 
reading disability [(55 + 93)/459] and an incidence of 12% in the Sever e group 
(55/459). These frequencies can be more easily visualized in Figure 1 by 
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comparing the longitudinal changes in the incidence of reading disability 
(based on Classroom Reading Level) from the end of kindergarten (1971) to the 
end of Grade 3 (1974), Inspection of this figure reveals an increasing 
incidence of overall reading disability from Grades 1-3 but with a leveling 
off in the Severe reading group betweer the ends of Grades 2 and 3 (1973- 
1974), In other words, the increase in overall reading disability in the last 
follow-up year (Grades 2-3) was due to an additional increase (N = 27) in the 
number of Mild reading cases * It should be kept in mind that this increasing 
incidence of reading problems across years has occurred despite the equally 
increasing intervention of the schools to provide remediation for these dis- 
abled readers ( Severe and Mild ) * 

insert figure 1 about here 

Of particular interest are the changes in reading disability between the 
ends of Crades 2 and 3 (1973-1974)* These changes can be visualized by com- 
paring the column totals in Tables 3 and 7* The totals in the Sever e group 
remained essentially unchanged (Grade 2, N = 54; Grade 2, N = 55) • However, 
there were S changes within groups* Of the 54 Ss in the Severe group (Grade 
2) , 38 remained in Grade 3. Of the 16 Ss who changed from the Severe group, 
10 were reassigned to the Mild group in Grade 3 and six to the Average read- 
ing group* Despite these changes in the Severe group between Grades 2 and 3, 
there were an additional 17 Ss who were reassigned to the Severe group in Grade 3 
from the remaining groups in Grade 2 (1973)* Further, 13 of these 17 Ss came 
from the Mild group and four from the Average reading group. 

Figure 1 also shows that there was a six percent increase in the incidence 
of overall reading disability between the ends of Grade 2 and 3 (1973-1974). 
This increase was due to :he reassignment of 27 additional cases to the Mild 
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group (Grade 3), 10 of whom came from the Severe group (Grade 2) and 17 of 
whom came from the Average reading group (Grade 2). 

The next question was whether this reassignment of Ss into different 
criterion groups (Grades 2-3), plus the uncontrolled treatment intervention 
in the schools, would lower the predictive accuracy of the tests given in 1970 
(n = 19), To aiu'w* this question, a four-group discriminant function analysis 
(Program DSCRIM) was again computed on the test scores (1970) against the 
criterion reading membership (1974) . These results can be seen in Table 7 by 
comparing the test predictions (+,-) in the rows (1970) against the criterion 
outcomes in the columns (1974). For valid positives, the terts correctly pre- 
dicted 50 of the 55 Severe cases (V p * 91%) and 64 of the 93 Mild cases 
(V p « 697u) „ For valid negatives, the tests correctly predicted 163 of the 
234 Av erage readers (V n = 707 o ) and 72 of the 77 Superior readers (V n = 94%) • 
Or.ce again, the predictive accuracy of the tests was high, particular in the 
more important extreme groups, despite an interval of almost four years fjrom 
te k <t probe to criterion determination. In terms of overall hit-rate, the 

tests correctly predicLed 349 (50 + 64 + 163 + 72) of the 459 Ss or 767o of 
10 

the population. This hit-rate is almost identical to the results of the 

third year follow-up (Grade 2, 1973). Of particular importance is the fact 

that the tests continued to show greater accuracy for those children de&Uted 

to extremes of the reading distribution in later years. It is these children 

that educators must identify, hopefully during primary grades, in order to 

institute more effective programs for future growth and development. In this 

context, the potentially gifted child should be given the same consideration 

as the high risk child. To stifle creativity or achievement in a gifted child 

by improper placement or lack of stimulation is almost as negligent as the 

failure to identify and help the immature child at a critical time in his 
delayed development. 

2.1 
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Predictive ranking of tests , A stepwise regression analysis was also 

computed to determine the ranking of the predictor variables in terms of their 

criterion discrimination. Table 8 presents the ranking of the most accurate 

variables along with their cumulative hit frequencies and factorial loadings* 

Inspection of this table reveals that the Finger Localization Test ranked 

highest (707o) followed cumulatively by the Alphabet Recitation Test (74%), 

Recognition - Discrimination Test (77%) and Dlchotlc - Llstening Test ( Total 

Recall ) {11%). The remaining tests contributed an additional increment of 

11 

less than two percent to the total hit-rate of 78.2%, This table (Table 8) 
again shows that three of the same tests ranked highest in predictive discrim- 
ination across each of the follow-up years. Further, that each of these tests 
have loaded on Factor I. In the present study, all of the four top ranking 
tests loaded on Factor I which again buttresses the significance of this fac- 
tor in forecasting subsequent reading achievement, 

insert Table 8 about here 

C ross - Valldatlon : Third Year Follow-up (1971- 1974 ) 

The following section summarizes the results of the final and major eval- 
uation of the predictive test battery given at the beginning of kindergarten 
in 1970. Despite the high predictive accuracy which was demonstrated across 
each of the follow-up years (Grades 1-3), for both the standard and abbrevi- 
ated test batteries, the results must be interpreted as only tentative until 
replicated on an independent group of children with the same time interval 
between test probe and criterion determination. This methodological require- 
ment must be met notwithstanding the high predictive validity and internal 
consistency (predictive ranking) of the tests across follow-up years. 
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In the present study, it was decided to cross-validate the test weights 
(lambdas) derived from the third year follow-up of the standardization popula- 
tion (Grade 2) against a new sample of children who were tested at the begin- 
ning of kindergarten in 1971 and whose criterion reading scores were obtained 
three years later at the end of Grade 2 (1974). 

This cross-validation sample included 181 Ss who represented the total 
population of white kindergarten boys enrolled in five of the largest elemen- 
tary schools in Alachua County (based on the 1970 census). They were individ- 
ually tested, under identical conditions to the standardization population in 
1970, in a mobile testing laboratory with four assessment modules. The only 
difference in this new sample was the fact that the children were all selected 
from urban schools. 

This additional sample was follwed for three years until they reached the 
end of Grade 2 (1974) at which time reading assessments were made. For com- 
parison purposes to the third year follow-up study (standardization group), 
two types of reading assessment were again employed: Criterion I (Classroom 
Reading Level) and Criterion II (Classroom Reading Level plus IOTA Word Rec- 
ognition) . 

Criterion L analysis . This analysis was based on the test scores of the 
cross-validation group during kindergarten (1971), multiplied by the individ- 
ual lambda weights derived from the standardization discriminant function 
analysis (Grade 2), against the criterion of Classroom Reading Level assessed 
at the end of Grade 2 for the cross-validation group (1974). Reading measures 
were obtained on 175 of the original 181 Ss or 96% of the sample. The result- 
ing distribution of reading groups can be seen in the column totals of Table 9: 
S evere (N = 18), Mild (N = 25), Average (N = 92) and Superior (N = 40). This 
distribution reveals a 25% overall incidence of reading disability f(18 + 25/175) 
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and an incidence of 10.37 u in the Severe group (18/175) . These percentages 
are almost identical to the figures obtained with the standardization popu- 
lation at the end of Grade 2 (1973) [Reference Table 3 and Figure l] . 



insert Table 9 about here 



The predictive nits of the abbreviated test battery (n « 8) can be seen 

in Table 9. For valid positives, the tests correctly predicted 16 of the 18 

S evere cases (V = 89%) but only 9 of the 25 Mild cases (V = 36%). However, 
p p 

for valid negatives, the tests correctly predicted 64 of the 92 Average read- 
ers (V n * 70%) and 37 of the 40 Super-* or readers (V n * 93%). These detection 
outcomes are virtually the same as with the standardization population fcr the 
second, third and fourth follow-up years (Grades 1-3). That is, the tests 
were again more predictive of the extreme groups in the reading distribution 
(i.e., Severe and Superior ) . In terms of overall hit-rate, the tests correctly 
predicted 126 (16 + 9 + 64 + 37) of the 175 Ss or 72% of the sample. This 
predictive accuracy, based on weights derived from a different population 
and with a test-criterion interval of nearly three years, lends convincing 
support for the intrinsic validity of the tests. In fact, the overall hit- 
rate was even higher when the full test battery was employed (Hx « 76%); 
however, the full battery failed to detect as many severe high risk children, 
la other words, its false negative rate was higher than its false positive rate. 



insert Table 10 about here 
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Predictive ranking of tests . A stepwise regression analysis was computed 
on the full battery to see if the discriminative ranking of the tests remained 
essentially the same for this cross-validation sample of children. The 
results can be seen in Table 10. The Finger Localization Test ranked 
highest (66.9%) followed 
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cumulatively by the Embedded K^gum Test (68.27„), the WISC Similarities 
Subtest (70.97,) and rhe Finger Tap ping Test (Total ) (71.5%). The remaining 
tests contributed an additional increment of less than six percent to the 
total hit-rate of 77.57o (using uncross-validated weights). It is interesting 
to note that the Finger L ocalization Test again ranked highest in terras of 
criterion discrimination; however, the following rankings varied from the 
standardization studies in terms of tests and factorial loadings. Although 

♦ 

the two top ranking tests were again associated with Factor I, the next two 
were associated with Factor III ( Similarities ) and Factor IV ( Finger Tapping 
Test). This change in discriminative ranking must reflect subtle differences 
in the composition of the cross-validation group as compared to the standard- 
ization population. One difference already mentioned was the selection bias 
in favor of urban schools. A second difference might relate to the expected 
discrepancy between a sample (N = 181) and a population (N = 497). Regardless 
of the reason for this difference, it does suggest that the differential dis- 
criminability of the tests will vary, impart, as a function of the population 
or sample selected. However, if the battery consists of at least eight tests, 
the overall hit-rate should not be affected. 



insert Table 11 about here 
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Criteri on LI analysis . This analysis was based on the test scores of the 
cross-valiJation group during kindergarten (1971), multiplied by the individ- 
ual lambda weights derived from the standardization discriminant function 
analysis (Grade 2), against the combined criterion of Classroom Reading Level 
and IOTA Word Recognition assessed at the end of Grade 2 for the cross-valid- 
ation group (1974). The IOTA was individually administered to those children 
who still resided in Florida and Georgia by the end of Grade 2. For this 
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reason, criterion information was available, on only 151 of the original 181 
Ss (837). Inspection of the column totals in Table 11 reveals a 407, overall 
incidence of reading disability in this sample [(25 + 36)/15l]and a 16% 
incidence in the Severe group (25/151). The higher overall incidence in this 
sample may simply reflect sampling bias due to the smaller N. However, the 
percentage of Severe cases (16%) was identical to the standardization group 
(Grade 2) for the co mbined reading criterion (Table 5) . 

The predictive hits against this additional criterion can also seen in 
Table LI. For valid positives, the tests correctly predicted L9 of the 25 
S evere cases (V p = 76%) and L9 of the 36 Mild cases (V p « 53%). For valid 
negatives, the tests correctly predicted 44 of the 60 Average readers (V n -73%) 

and 29 of the 30 Superio - readers (V n = 97% ) • In other words » tl " tests 
correctly predicted 111 (19 + 19 + 44 + 29) of the 151 Ss or 74% of the cross- 
validation sample. Again, the accuracy was largely confined to the extreme 
groups ( Severe and Su perior ) . This hit-rate of 74%, based on the lambda 
weights from a different standardization group and with a test-criterion in- 
terval of nearly three years, provides additional support for the predictive 
validity of these kindergarten measures. In fact, the shrinkage in this cross- 
validation analysis was only two percent when compared to the hit-rate for the 
third year standardization follow-up, using this combined criterion (Table 5). 

For purposes of brevity, the stepwise regression analysis (full battery) r«vea 
essentially the same discriminative ranking of tests for this combined read- 
ing criterion as for Classroom Reading Level (Table 10). The Finger Local- 
ization Test (Factor 1) again ranked highest followed cumulatively by the 
Embedded F igures Test (Factor I), WISC Similarities Subtest (Factor III) and 
Socio-economic Level (Factor II). The only change in this ranking was the 
ubstitution of Socio-economic Level for the Finger Tapping Test. However, 
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the discriminative ranking of tests changed somewhat between the standard- 
ization group and the cross-validation group analyse*. This change may well 
reflect sampling differences between the two groups. Nevertheless, the Finger 
Localization Test (Factor I) retained its primary ranking across follow-up 
years and groups lending further support to the validity of this factor as a 
precursor to later reading disability. 

CONCLUDING REMARKS 

The preceding discussion has focused on the predictive accuracy of a 
developmental-neuropsychological test battery which was given to a total pop- 
ulation of white boys during kindergarten (1970) in order to forecast their 
reading achievement in subsequent years. This was accomplished by assessing 
reading in this population independently at the end of Grades 1, 2 and 3 ( 1972- 
1974). The results, based on systematic evaluation at yearly follow-up per- 
iods, revealed that performance on these ttests during kindergarten was ex- 
tremely predictive of the child's reading group membership in later grades-- 
particularly those children destined to become Severely Disabled or Superior 
readers. Also, it was shown that this accuracy was largely accounted for by 
a small number of tests which consistently ranked highest in predictive dis- 
crimlnability. These latter tests represented measures of sensori-perceptuat 
motor-mnemonic skills which were postulated to be in rapid development during 
preschool years and crucial to the early phases of reading. It was also 
shown that these predictive tests could be applied in educational settings as 
detection procedures for early intervention decisions. A utility matrix was 
presented to illustrate this concept of statistical decision theory. Finally, 
it was shown that the test weights derived from the standardization popula- 
tion (K-G2) held up under cross-validation for a separate group of children 
who were followed for three years (K-G2) . 
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These findings, in summary, lend substantial support to the validity 
and utility of an early detection or 'warning system 1 that could be admin- 
istered economically before the child begins formal reading--at a time when 
his central nervous system may be more plastic and responsive to change-- 
and at a time when he it less subject to the shattering effects of repeated 
academic failure. One of the real tradegies in our educational system has 
been the tendency to institute the intervention program only after the read- 
ing disability has become clinically manifest, at which time secondary be- 
havioral and psychological problems are often confounded with the underlying 
handicap (de Hirsch and Jansky, 1966). It may be that these secondary prob- 
lems serve to exacerbate if not reinforce the vicious cycle of frustration 
and failure for the child. 

This problem of delayed intervention is probably based on multiple fac- 
tors including limited diagnostic and treatment resources and, sometimes, 
the naively optimistic view that the slow starter will eventually " catch up' 1 
if left alone. The incidence changes across years in the present study 
should provide a sobering contrast to this view. The incidence of overall 
reading disability has continued to increase yearly in our longitudinal pop- 
ulation despite the increasing intervention efforts of the schools during 
this same time period (reference Figure 1). Furthermore, although the in- 
cidence of severe reading disability seems to have plateaucd between Grades 
2 and 3, it has never decreased—again, despite the remedial programs for 
these children in the schools. 

The high predictive accuracy of the test battery (standard and abbrevi- 
ated) has two interesting implications concerning the nature of this dis- 
order—namely, dyslexia. First, the fact that these kindergarten tests as- 
sessed a variety of developmental skills vrhich were shown to forecast sub- 
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sequent reading competency suggests that the disorder is the consequence of 
antecedent events which are present in the child as early as kindergarten-- 
and most probably before. Second, the fact that these antecedent factors 
comprised nonreading skills suggests that dyslexia is more than a reading 
disorder per se. This latter finding is most compatible with those theories 
that conceptualize dyslexia as a disorder in central processing (Chalfont and 
Scheffelin, 1969). 

One instructive point to be drawn from these comments is that faulty 
methods of instruction in the schools cannot be singled out as the primary 
cause of reading failure today (Fufcth, 1972). The results of the present 
study indicate that there are precursors which are reflected in the behavior 
of the child- -at least as early as kindergarten- -which forecast reading 
competency before formal reading instruction or remediation is commenced. 
This statement, however, is not meant to exclude an interaction between the 
child's readiness and the type of instruction involved* Certainly, faulty 
instruction will serve to intensify the problems of a high risk child. This 
point has never been disputed. 

But what are these precursors or antecedents that forecast future 
reading competency? In the present context, these precursors seemed to 
reflect a general developmental readiness in perception, cognition, lan- 
guage and memory. There was no one particularly striking function or skill 
area that stood out. For those children destined to become severely 
disabled readers in four years, their performance during kindergarten was 
depressed or delayed on almost all of the developmental tests—especially on 
the most discriminative predictors (i.e., Finger Localization, Recognition- 
Discrimination, Alphabet Recitation). In this respect, the precursors 
suggest some type of developmental unreadiness or immaturity. For example, 
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during kindergarten assessment, this group of severely high risk children 
could only recite parts of the alphabet, their visual-motor integration on 
the Beery was approximately 10 months behind their chronological age* 
Similar delays in somatosensory integration, perceptual discrimination, 
auditory discrimination and verbal-cognitive processing also characterized 
the performance of these severe high risk children. 

By contrast, those children destined to become superior readers in 
three or four years were advanced on almost all of the developmental tests 
given in kindergarten. This comparison can be quickly visualized by 
inspection of Table 12 which presents the means for each of the tests in the 
Abbreviated Battery for each of the reading groups at the end of Grade 2. 
The potential superior readers were advanced in terms of visual -motor , 
perceptual, somatosensory and language development- -and they were intellecr 
tually much brighter (IQ = 113 vs. 92). In fact, those children destined 
to superior reading levels in four years were advanced developmental ly 
whether compared within Ss (to their chronological age) or between Ss (to 
othei roading groups). In fact, Table 12 shows that as one moved from the 
Severe group to the Mild , Average and Superior reading groups, that overall 
performance increased in linear additive increments. There was no special 
pattern (non linear) that characterized any of the groups except an increas- 
ing developmental integration in perception, cognition, language and memory 
as one proceeded from the high to the low risk groups* 

This latter finding should dispel any premature attempts to explain spe- 
cific reading disability as a primary lag or defect in one particular 
modality or function (e.g., auditory sequencing or memory, etc,)* The pro- 
blem is unfortunately not that simple, particularly when viewed within a 
longitudinal framework. Yet, considerable controversy continues to exist 
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concerning the nature of the disorder- -namely , dyslexia and it's etiology. 
All too often these terms are used interchangeably or are poorly defined. 
Similar problems could occur in the present study if one were to treat 
the developmental precursors as causal antecedent events. At best, the 
precursors represent phenotypic behavioral expressions of events intrinsic 
to the child which are probably mediated by more basic brain mechanisms. 
Hence, the elusive and complex nature of the problem. 

At present, the concept of a developmental readiness seems to repre- 
sent the most parsimonious and least pernicious way to describe these 
behavioral differences (precursors) that characterize children destined in 
later years to extremes in the reading distribution. This concept, which 
has long been proposed by others (de Hirsch and Jansky, 1966; Ciitchley, 
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1968; Money, 1962) Is certainly compatible with the theory proposed in this 
paper (Satz and Van Nostrand, 1973) . However, the present results do not 
permit a test of the mechanism which is postulated to underlie this dis- 
order- -namely, a lag in the maturation of the cerebral cortex (Satz and 
Van Nostrand, 1973), This formulation must still be treated as a hypothe- 
tical construct, the mechanism of which is neither observable or directly 
measurable at the present time. However, the formulation has some 
heuristic value in that it postulates that this lag mechanism retards the 
acquisition of those developmental skills which are in primary ascendancy 
during preschool and which are crucial to the early phases of reading. 
This statement, which is conceptualized in developmental -behavioral terms, 
is subject to more direct test and evaluation. The theory is also explicit 
with respect to this concept of a developmental lag. First, it states that 
the precursors of this disorder can best be explained in developmental 
terms, without recourse to terms such as damage, impairment or dysfunction. 
Second, that this lag in brain maturation will delay the acquisition of 
the different stages of thought that unfold during the developmental 
process. This latter position is critical to the second objective in our 
longitudinal project --namely , developmental changes in this disorder over 
time. However, not enough time has elapsed in the project to determine 
the nature of these developmental changes, if any, from ages 5-11, The 
theory nevertheless states that this lag in brain maturation (genetic or 
otherwise) will delay the acquisition of the developmental stages in 
thought for the dyslexic child. In this context, the disabled reader is 
expected to be constantly lagging behind his normal age control. In 
younger years, the pattern will involve those skills which are in primary 
ascendancy at that age (e.g., perceptual, cross-modal integration). 
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In later years the pattern will change as the dyslexic child eventually 
M catches up M on the earlier developing skills--now only to be lagging on 
those crucial later-developing cognitive language skills which unfold 
from the preceding stages of concrete perceptual operations . If the 
dyslexic child fails to M catch up" on these more cognitive- language skills 
which develop in later childhood- -when the brain is reaching full matura- 
tion-then more permanent delays in language and reading skills are 
predicted (Satz and Van Nostrand, 1973). 

Evaluation of this part of the theory will not be completed until the 
children reach 11 years of age in 1976 (Grade 5) * The test of a develop- 
mental lag hypothesis, however, will be possible because of the repeat ad- 
ministration or the tests at three year intervals since kindergarten 
(K, G2, and G5) . Preliminary evaluation between grades K and G2 (Satz, 
Friel and Rudegeair, 1974) provided some initial support for this hypothe- 
sis of delayed acquisition. Only time will determine whether these 
preliminary findings are replicable. If sc, they should provide additional 
knowledge concerning the precursors and developmental course of this 
disorder over time. This should bring us one step closer to the major 
questions of etiology and prognosis. These questions can and must be 
answered if we are to prevent or reduce tlie large number of human casual- 
ties that needlessly result when academic problems begin so early in life. 
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Footnotes : 

* Research supported in part by funds from The National Institute of Mental 
Health, Behavioral Sciences Research Branch (MH 19415). 

" Nineteen variables derived from the kindergarten battery were utilized in 
these studies. A detailed description of each variable is reported else- 
where (Satz and Friel, 1973). They consist of (1) Da^ of Testing (2) A ge , 
(3) Handedness , (4) Fing er Tapping Total , (5) Peabody Picture Vocabulary 
Test , (6) Recognition - Discrimination, (7) Embedded Figures ,. (8) Verbal 
Fluency , (9) Beery Developmenta 1 Test of Visual - Motor Integration , (10) 
WISC SimilarUles Subtest , (11) Alphabet Recitation , (12) Right - Left Dis - 
crimination , (13) Finger Localization , (14) Auditory-Discriminat ion , (15) 
Dichotlc Listening , Right Channel Recall , (16) Dlchotic Listening , Left 
Channel Recall , (17) Auditory - Visual Integration , (18) Behavioral Check - 
list and (19) Socio-economic Status . 



J A preliminary one year follow-up to the end of kindergarten is reported 
in a separate study (Satz and Friel, 1973)* 

4 This hit-rate was based on a standard computer generated cutting line 
(Satz, 1966); however, the hit-rate increased to 91% when an adjusted 
cutting line was employed (reference Satz and Friel, 1974), 

Day of Testing , which also loaded on Factor I, was excluded as a factorial 
variable because of its low correlation coefficient with this factor 
(r £ .37). 

Fewer Ss were included in this combined criterion than the Classroom 
Reading Level group since the IOTA was administered to only those Ss who 
still resided in Florida or Georgia at the end of Grade 2, whereas the 
teacher questionnaires were distributed throughout the USA and abroad. 

^ A 34% overall incidence of reading disability, using this more refined 
composite reading criterion, is approximately what would be expected for 
a male population. If the national incidence is approximately 20% with a 
sex ratio of 4:1 in favor of bpys, then the incidence should increase to 
32% in a male population. 

8 The abbreviated battery comprised the following: (1) Peabody Picture Vo- 
cabulary Test . (2) Recognition-Discrimination Test, (3) Beery Develop- 
mental Test of Visual-Motor Integration. (4) Alphabet Recitation Test . 
(5) Finger Localization Test . (6) Auditory - User imlnat ion Test. (7) 
Dichotic Listening Test ( Total Jl arid L. Recall) and (8) Socio - economic status « 

Q 

This figure (N * 459) is almost identical to the available population in 
the third year follow-up (N » 458). 

^ Similar discrimination between groups was observed for the abbreviated 
battery (N- 8) in this four year follow-up. 
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The slight discrepancy in total hit-rate between DSCRIM and BMD07M is 
believed to reflect computational differences between the two programs. 
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Table 1 



Predictive Classification of Ss into Criterion Reading 
Groups ( 1972) a based on Discriminant Function 
Composite Scores (1970) 



Composite 
Test 
Scores 



N 

Z 



Severe 
18 
(100) 



Criterion Groups 



Mild 
39 
(71) 



Averag e 
55 
(16) 



Superior 
3 

(5) 



(0) 



16 



(29) 



284 



(84) 



58 



(95) 



18 



55 



339 



61 



Criterion = Classroom Reading Level (End of Grade 1) 
(N T » 473). 



ERIC 



42 



Table 2 

Discriminative Ranking and Cumulative Classification 
of Tests by Factor Loadings based on a 
Discriminant Function Composite Score 



Ranked Factor Cumulative 

Variables % Correct 



1 . V i nger 

Localization I 76.1 

2. Recognition- 
Discrimination I 77.2 

3. Day of Testing 79.1 

4. Alphabet Recitation I 81.6 

5. Residual Tests I-TV 84.4 



a 



Criterion = Classroom Reading Level (end of Grade 1) . 
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Table 3 



Predictive Classification of Children into Criterion 
[leading Groups ( 1973 ) a based on Discriminant 
Function Composite Scores (1970) 

Composite Criterion Groups 
Test 

Scores^ 

Severe Mild Average Superior 

N 48 47 74 4 

+ 

% (89) (71) (27) (6) 

N 6 19 196 64 

7. (11) (29) (73) (94) 



T 54 66 270 68 



a Criterion = Classroom Reading Level (N T ■ 458). 

b Program ■ DSCRIM, Multiple Discriminant Analysis (4 group predictions 
collapsed), prior probabilities set equal. 



s 
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Table 4 



Discriminative Ranking and Cumulative Classification of 
Tests by Factor Loading based on Stepwisg 
Discriminant Function Composite Scores 9 



Ranked 
Variables 



Factor 



Cumulative 
7o Correct 



L. Finger 

Localization 

2. Alphabet Recitation 

3. Recognition- 
Discrimination 

4. Day of Testing 

5. Residual Tests 



I 
I 



I- IV 



70,5 
75*5 

77.3 
76,9 
77.7 



a Criterion = Classroom Reading Level 

b Program = BMD07M, Stepwise Discriminant Analysis (4 group), 



prior probabilities set equal. 
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Table 5 

Predictive Classification of Children into Criterion 
Reading Groups (1973) a based on Discriminant 
Function Composite Scores (1970) 



Composite 
Test 
Scores 



N 

+ 

% 



N 

X 



T 67 77 214 61 



a Criterion = Classroom Reading Level & IOTA Word Recognition 
(N = 419). 

b Program = DSCRIM, Multiple Discriminant Analysis (4 group 
predictions collapsed), prior probabilities set equal. 



Criterion Groups 



Severe Mild Average Superior 

61 51 68 2 

(91) (66) (32) (3) 

6 26 146 59 

(9) (34) (68) (97) 
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Table 6 



Probability of Decision Risk (Treatment /No Treatment) 
associated with Differential Composite Test Score 



Predictions to Third Year Reading Level 



a 



Composite 



Test 
Scores b 


Decision 


Severe 


Criterion Groups 
Mild Average 


Superior 


Ratio 
Correct 


£ 


+ + 


T 


50 


18 


15 


0 


68/83 


.82 


+ 


T 


11 


33 


53 


2 


44/99 


.44 




NT 


5 


23 


84 


11 


95/123 


.77 




NT 


1 


3 


62 


48 


110/114 


.96 



Total 67 77 214 61 

Criterion = Classroom Reading Level & IOTA Word Recognition (N T - 419). 

Program - DSCRIM, Multiple Discriminant Analysis (4 group), prior 
probabilities set equal. 
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Table 7 



Predictive Classification of Children into Criterion 
Reading Groups (1974) a baser on Discriminant 
Function Composite Scores (1970) 



Composite 
Test . 
Scores 



N 



7 



Severe 
50 
(91) 



Criterion Groups 



Mild 
64 
(69) 



Average 
71 

(30) 



Superior 
5 

(06) 



N 

7. 



5 

(09) 



29 
(31) 



163 

(70) 



72 
(94) 



55 



93 



234 



77 



Criterion ■ Classroom Reading Level (N = 459). 

Program = DSCRIM, Multiple Discriminant Analysis (4 group predictions 
collapsed), prior probabilities set equal. 
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Table 8 

Discriminative Ranking and Cumulative Classification 
of Tests by Factor Loadings based on StegwjLse 
Discriminant Function Composite Score ' 



Ranked Factor Cumulative 

Variables % Correct 



1. Finger 

Localization I 70.4 

2. Alphabet 

Recitation I 74.1 

3. Recognition- 
Discrimination I 76.5 

4. Dichotic 

Listening (Total) I 77.0 

5. Residual Tests I-IV 78.2 



Criterion = Classroom Reading Level 

Program = BMD07M, Stepwise Discriminant Analysis 
(4 group), prior probabilities set equal. 
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Table 9 



Predictive Classification of Cross-Validation Sample 
into Criterion Reading Groups (Grade 2, 1974) a 
based on Discriminant Function Weights of 
Standardization Population (1970-1973) 



Test . Criterion Groups 

b 



Composite 
Test 
Scores 



Severe Mild Average Superior 

N 16 9 28 3 

+ 

% (89) (36) (30) (7) 

N 2 16 64 37 

7- (ID (64) (70) (93) 



T 18 25 92 40 

a Criterion = Classroom Reading Level (N = 175) 
b Abbreviated Test Battery 
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Table 10 

Discriminative Ranking and Cumulative Classification 
of Tests by Factor Loadings based on Discriminant 
Function Composite Score a ' 



Ranked Factor Cumulative 

Variables % Correct 



1. Finger 

Localization I 66,9 

2. Embedded 

Figures I 68.2 

3. Similarities III 70.9 

4. Finger 

Tapping (Total) IV 71.5 

5. Residual Tests I-IV 77.5 



Criterion = Classroom Reading Level (N T = 175). 

b Program * BMD07M, Stepwise Discriminant Analysis 
(4 group), prior probabilities set equal. 
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Table 11 



Composite 
Test 
Srores b 



Predictive Classification of Cross-Validation Sample 
into Criterion Reading Groups (Grade 2, 1974) a 
based on Discriminant Function Weights of 
Standardization Population (1970-1973) 

Criterion Groups 



N 



Severe 
19 
(76) 



Mild 
19 

(53) 



Average 
16 

(27) 



Superior 
1 

(3) 



N 

% 



6 

(24) 



17 

(47) 



44 



(73) 



29 



(97) 



25 



36 



60 



30 



a 



Criterion = Classroom Reading Level and IOTA Word Recognition 
(N = 151). 

b Abbreviated Test Battery 
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Table 12 

Means and Standard Diviations of Tests (Abbreviated) 
Battery, Grade K) for Reading Groups (Combined 
Criterion, Grade 2) 

Criterion Groups 

Severe Mild Average Superior 

N=67 N=77 N=214 N=61 

Age a = 6 5 . 9 Age=65.6 Age- 66. 4 Age=67.0 



Finger 

Localization 


26.2 
(7.1) 


31.1 
(8.0) 


35.9 
(7.6) 


39.5 
(5.2) 


Alphabet 
Recitation 


14. 0 
(8.8) 


17.5 
(8.5) 


22.2 
(6.8) 


25.3 
(6.9) 


Recognition- 
Discrimination 


6.9 
(2.9) 


8.9 
(3.0) 


9.9 
(2.6) 


11.6 
(2.0) 


Peabody 


92.0 
(16.5) 


100.2 
(12.3) 


106. A 
(14.9) 


113.1 
(12.1) 


Beery b 


56.3 
(7.5) 


60.7 
(8.1) 


66.5 
(9.3) 


71.4 
(10.8) 


Auditory- 
Discrimination 


1.1 

(.5) 


1.1 

(.5) 


1.3 
(.4) 


1.5 
(.3) 


Dichotic 

Listening (Total) 


57.6 
(13.7) 


63.0 
(14.5) 


68.1 
(14.2) 


76.4 
(13.9) 


Socio-economic 
Status 


1.7 
(.4) 


1.8 

(.4) 


1.9 

(.3) 


2.0 
(0.0) 



Age in Months 
Score in Months 
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Grades/Years 



F'gure 1, Follow-up changes In Incidence ' Overall 
Reading Disability (Severe and Mild) and °evere Read- 
ing Disability in population from end of Grades K - 3 
(1971 - 1974)* 
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